rOCOHEMT E£SOH£ 

2D 125 OOH 95 CS 501 «10 



IHS2IT3TI01; ' 

P05 DAI2 
SCTZ 

ED5S F51CS 
DZSCEIPTOHS 



IDEKTIFJ2HS 



Buaegeair^ Hobert 2. 

Linguistic Factors in the lissBssnent of Speech Sound 
Biscriffiiaation. Peport So. 52. 
Southwest STegional La^ratory for Hducatiosal 
Sesearch afid Sevelopaenx^ los Alanitos, Calif. 
^Office of 2aucation (DEBii) , SashingtcUr B.C. 
5FEL-TH-52 
25 Apr 75 
27p. 

Mr-$0.83 HC-$2.06 Plus Postage. 
*Auait^y Discrimination; *Aural Learning; 
Saucational Besearch; Zleaentary Zducation; 
=i^Ll3iguistics; *L-isteiiing Coaprehension; ^Lisrening 
Tests; *I5easurea3at Instruaents; Technical Beports; 
2est Construction 

Southwest Regional Laboratory; S'rfBL 



AgSTESCT 

2he results of recent studies are presented which 
show thar traditional auditory disorijainatior tests for children ^ 
which require the e'xacinee'-to distinguish one speech sound froa 
another are ill-conceived and consegoently of little practical value. 
Linguistic variables requiring attention in designing useful speech 
scurd discriainaxicn instruments are discussed. (Author) 



* Documents acquired by S2IC include many informal unpublished * 

* materials not available from other sources. EaiC makes every effort ♦ 

* tQ, obtain the best copy available. IJevertheless, items of marginal * 

* reproducibility are often encountered an^ this affects the quality ♦ 

* of tfie microfiche and hardcopy reproductions 2RIC makes available * 

* 7ia the ERIC Document 2eproduction Service (EDSS) . 2DBS is not * 

* responsible ior the quality of the original document. Seproductions * 

* supplied by EDBS are the best that can be made frca the original. * 



i -J 




SWTIL 



SWRL EDUCATIONAL RESEARCH AND DEVELOPMENT 

TECHNICAL REPORT 52 - 

APRIL 1975 ' ■ . - 



« 



LI!;GriSTlC FACT».iR5. i:; THE ASSESS!!E:n- OF SPEECH SnU.MJ DISCRIMI!«\TIO:: 
Robert E- Ruds-gisair . • ' 



ERIC 



ABSTRACT 



The results of recent studies^^'e presented which show that tradi- 
tional auditor-/ discrinination tests for children whicl^ require die 
examinee to distinguish one speech soiHid froa another are, ill-conceived 
and consequently of little* practical value". Linguistic variables requiring:: 
attention in designing useful speech sound discrimination instruments are 
discusser * ' 
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Li:;CUlSTIC factors Hi THE ASSESS«E!n" OF SPEECH SOUCT DtSCRIMirJATlOS 
•Robert E- Hudegeair ' . 



Auditoty discrimination tests are designed to asses's children's 
ability to discriminate one speech sound fron another.- The najor 
applications for such tests in educational .and clinical settings are 
hased on correlational studies of iferfomance involving auditory dis- 
criiiination test scores, measures of articulation, and early reading' 
achieverent. labile 'the liteifature is anbiguous, the nDtion has none-* 
theless prevailed that inadequately developed auditory jdlscrfiainat ion 
is a causal factor in articulation disorders as veil as reading dis- 
abilities. In this paper the results of 'recent ^studies are presented" 
to shoi: that traditional auditory discrimination tests, on vhich the 
correlations cited are dependent, are basically ill-conceived and con- 
sequently of little practical value. 

- ^ * 

The uncritical acceptance of traditional audxtory discrimination 
assessment procedures by the educational and speech research communities 
-has recited in a grossly underestimated appraisal of normal audxtory 
perceptual proficiency in school-ready children (see, e,^g., Beving 6 
Eblen, 1973). More serious, however, is the gap created in the data 
base on ^chich models of language acquisition are founded. Sensitive 
measures of speech sound perception are necessary to adequately diag- 
noyie specific* language problems in children and to enhance our under- 
standing the nonaal course of evehts in the development of psycho- 
linguistic processes. 
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TRADXTIOm TESTS OF AUDITORY DISCRIMINATION 

The prototype for standard tests of auditory discrimination was 
devised by Travis and Rasmus (1931). Hie^test is comprised of J5l 
pairs of contrasting nonsense syllables (e.g., Jte/^/da/) and 35 non- 
contrasting pairs (e.g."^, /ta/-/ta/). Subjects arc asked to respc^nd 
"same" or "different" to each pair; as it is presented orally. Subse-, 
quently the Travis-Rasmus Speech ^Sound Discrimination Test was used 
in a study by liall (1938).. Both studies compared speech sound dis- 
crimination performance of normal subjects with that of articulatory 
defective subjects. Both sampled several age levels from kindergar- 
teners to adults. Travis and Rasmus found that experimental group 
performance on the discrimination measure was significantly poorer , 
at all agl^ levels. Hall found fhe opposite. 



• "The»-unwieldy list of . Jjnost*all possible English speech sound 
contrasts has not been used in subsequent testing. Since the .proto- 
type experimertts, speech sound discrimination tests have been confined, 
almost exclusively,' to syllable pairs that represent minimal phonemic 
differences* Minimal pairs are understood to mean^those pairs of 



speech sound's- that -contrast with f *igar<I to one. or sorctices tvo ^rtic- 
l3tory featiJreH\ The transition to. the restricted set of contrasts 
was rad^s gradually, hovever,*5s evidenced by the original speech sound 
discoaination t^c designed-*&y TenpAin (1943).* Her test consists of; 
200 itjms, .68. consonant' contrasts in syllable initial position, 68 in 
syllable final* ^r sedial position, and -64 non-coU4.rastive syiiable 
pairs- This lengthy .test was coapared t-o a "short test" of 70 itens 
and faYurable correlations were -denonstrated. Tnusv Tenplin concluded^ 
that" the short test vas an adcep table assessment instrument • The 
70-iran test, consisting 'of 51 contrasting and 19 non-contrasting 
syllabie^J becane the first test to fccus on riinirai pairs. Although 
Tc-nplin's test involves nonsense syllables and relies on sane-different 
jud^nts, it differs ^ several respects fron the Travis-Kasnus .model. 
ISiil^ Travis and Rasnus used only syllables, vhere the voweL:vas . 
•/d/, to test consonant discrininatioa, Tenpiin used-CV, VC, and VOf 
^/UablBS. She also varied the vovel, using three vowels and a • 
"'diphthong. in constructing her syllables.* Furthermore, no vowel con- 
trasts are presented in the Tenpiin test. In addition to Templin's 
own work, the short t^&t- discussed here and shorfer 50-iten t^t 
(Tenplin, 1957) have been used by several other investigators to 
studv the-' relation between articulation abiliXies and phonenic dis- 
crimination abilities; XKronyall & Diehl, 1954; Cohen & Diehl, 1963; 
Aungst i Frick, 1964;' Sherman. & G^itb,^ 1967) . 

The most frequently jised^kudifcory "discrimination test (in 
correlational studies) has been irJie Wepman test of, Auditory Discrim- 
ination (Wepman, 1958). We^pmanJs test contains 40 real-word pairs, 
10 "sane" pairs, and 30 "different" pairs. Cfely error? on "different" 
pairs count as discrimination errors. "Sane" pairs serve to keep a . 
subject honest and thereby constitute a theck on the validity of t]?e 
test (Wepnan, I960). Both. members of a contra$ting pair are equated 
for frequency of occurrence according to the Thorndike-Lorge (1944) ^ 
count. The Wepman ^est pi Auditory Discrimination is, like Templin*s 
short test, lasuie up of nrinimal pair contrasts only. But Wepman 's 
test, presents a restricted sample of all possible minimal pairs.. All * 
consonant contrast^ on* the test represent contrasts among stops and 
contrast^; ampng frictatives with regard tcf place of articulation. In ^ 
addition to consonant contr?<;ts (in initial and final position),, four . 
vowel contrasts are also included. Wepman (I9C0), in a preliminary 
report of the findings from studies using his test, concluded that^ 
there is^a de£inite relationship between fauljty articulation and poor 
discrimination,, bujt no 'data were presented.^ 

The Wepman" Test haa been .used i-n attempts to define relationships 
between auditory discrimination abilities and articulat-ion proficiency 
(Prins, 1963), beginning reading achievement (Christine & Christine, 
196.4; S'ivaroli & Wheelockj. 1966V Blank, i96;8), and differences in 
dialect (Deutsch, 1964;- Coll^r^ Coleman, & Schwartz, 1968; Deutsch, 
1972; Elenbogen & Thompson,^ 1972) . It is not inteilded in this paper 
to discuss the meritSvof these studies or the interdcpendencies or 



correlations aziongA^arious •llngiaistic proc*2Sses such as thosa cited* 
JSliv: literaturii is r^^farenced only to illustrate thi^t. traaitional 
auditory discrininatioa ^tests, in addition to their widespread ciinical\ 
'o^e, are heavily relied upon in educational and linguisi:ic research^ • 
It spears, hc-i-ever, that the overwhelning concern with correlation:? 
betv^en discrimination ability and various other language piTocesses 
has iJistracted the research coTrnmity fron addressing funSanental • ^ 
•'juetiticns abDut speech sound di*2crir2ination per se. The long-s landing 
assmpticn has been that since -speech sound contrasts are presented, , J* 
spee*:h sound discrinination is oeing tested. The resultjs lof several ^ 
rccv-nt^3tudie*i'cair this 'assumption into question and^enphasize the 
need to reevaiiaatc: prevailing notions of speech sound discrininatioij. 

^ COCNTTIVE FACTORS , n; AUDITORY DISCRIHINxYTION ASSESSMEIJT 

, Trie sanie-dif lerent technique typically, used assessing speech 
Si-^und ^discri^aination lias been critized on several grounds. .Vellutirio, 
J}€tSetto^ *and Steger (1972) hypothesized that discriiaination tasks 
vith unequal 'response alternatives have a built-in response bias 
because of an intri*isic tendency to employ response categories (e.g., 
'*3ace*' and "di£f erent *' ) with equal frequency. These investigators 
tested this hypothesis using the V?epnan Auditory Discrimination Test 
vhicfi *is conprised of 3D "different" and 10 "saae" items. * Half of 
their sanple of fourth, fifth, and sixth graders were administered 
tiie standard Wepman and xhe other half were administered a revised 
version oi the test in vhich same and different items were equalized. 
The group presented with equal numbers of response categories made 
significantly fewer errors, leading Vellutino et al. to c'bnclude that 
"the iv^epman and ins truarents with a similar forma t^^may be characterized 
by a substantial degree of vatiability unrelated'^to individual dif- 
ferences in auditory discrimination..." (p. 255).. In an earlier 
study, Briere (1967) also reported finding a bias , for responding 
"same" when the stimuli were different. Findings such as these raise 
serious questions about the interpretations given to error data 
gathered in the context of the traditional testing format. 

Further criticism has been leveled against the use of the same- 
different paradigm because it demands an operational understanding 
of ^the- concepts "^ame" and "different." " Beving and Eblen (1973) 
tested" 30 children 4- to 7- years-old to evaluate tKe ijifluende of 
the concepts "same" and "different" on speech sound discrimination , 
performance at various age levels. Thre^ groups, 4-, 6-, and 8-year- 
olds, were given a 25-item discrimination test under the same-different 
format. Three days later, the same children were given the same items 
and asked to repeat the syllable pairs in lieu of responding "same" 
or "different." From the responses in- this second task, same-different 
judgments were inferred by recording whether a child reported two 
words that were the same or two words that were different- No regard 
iias paid to repetition accuracy per se. Results showed that while 
all three groups exhibited significantly different error rates on the 



judgi^eiit test, perfomanc^f on the ^repetition test vas *^quiValenc 
across the three groups (error rate = approximately 8t). But only . 
f or j:he "i-year-oid group vas a significant diff erence found between 
the judgment scores and the* repetition sc^ores, judgment errors being 
significantly core nur:erous.. It was C4?n:cluded that while ^-yeafroids 
were able to discrininate among speech sounds on a par with 6- and 
8-year-olds, as shown by their performance on the repetition task, : 
they appear to be unable to work with the concepts "saae*' and "different." 

In an earlier study^ Blank (1968) sought to detemine Twhether 
difttirences la auditory discrinination scores between good and poor 
^readers were a^unction of failure to- discrininate or a failure related . 
Lo^ the ccnplex cugnitix'e processes denanded b> the sai2e-^if ferent task. 
Her study consisted of three experusents. In the first experiment, 
two group^of 7-year-0lds, classifi<;t> as good 5nd poor'readers, exhibited 
differential scores on a sane^dif ferent auditory discrimination task. 
•!*ine children in the original- group of poor readers had to be eliminated 
because of a failure to understand the task. With regard to this point. 
Blank suggested that "the need to make* a cognitive judgment 'of ^same- 
different' posed a problem for the retarded reader which went beyond 
tlie perceptual dez^ands of the task." Thus, the second experiment was 
designed to eliminate intervening cognitive judgment of "same- 
different" by having t^^child report directly the word pairs he heard, 
as in the Beving and Eblen study. Ulien ";>ame-dif ferent" judgments 
were inferred on the basis of what the child reported, the good readers 
still exhibited significantly lower error rates. This in in accord 
with Beving and Ebleri's finding that 6- and 8-year-olds showed no 
imnrovement when repetition* errors were compared to judgment errors. 

While it appeared that Che cognitive judgment of same or different 
d.id not in itself cause a problem for the poor readers, an analysis of 
the kinds of errors liiade by Blank's two groups still indicated that 
task strategy factors rather than percept^ial abilities accounted for 
differential performance between the groups. Poor readers^, _in the 
repetition data/ showed an overwhjalming tendency to perseverate pairs 
(i.e., match the second member of th6 pair to the first). Good readers 
did not exhibit a strong tendency in this regard and therefore made 
fewer errors. The highlfrequency of perseverations suggested a lack 
of attention across jJairl members and led Blank to her third experiment. 
This experiment was designed to remove the conditions lor p.erseveration 
bias and still test ac-curacy of perception. Thus, the children in 
Experiment 3' were required to repeat single words. The first ^'members 
of pairs employed in Experiments 1 and 2 were presented, one at a 
time, followed by the wotds that were originally second members. Each 
group exhibited* a mean ri te of correct repe4:ition of^ 85%, suggesting 
that the'groups are not c if terenjtia'ted as a function of speech sound 
perception abilities. Ccgnitive demands of the task, extraneous to 
speech sound perception, appear to account" for differential performance 
on a ^measure of ppeech sound discrimination. 



Of her studies iipve also, questioned the value o£ standard speech 
seund discrinination tests as mtasnres oi perceptual f>rocesses. Ihcs* 
teSiCs -are typically administered ii^ a single test session and are pri- 
suned to be 30 straightforward and- siriple tkat. training procedures 
*are' unnecessary. Yet, in the case of the Wepnan Test of Auditory 
Discriaination, it has been shown that significantly better performance 
. occurs, when o-year-olds repeat tha-test (Rudegeair & Kanil, 1970).. 
Berlin ^nd Dill (1967) also reported usprov^ed perfornance when feed-^ 
back and positive r^in^olrcezient vere provided during a second admin- 
istration of the kVpnan test. In spite of overwhelning evidence * 
that traditional test procedures preclude sensitiv<^ measures of per- 
ceptual abilities, these procedures continue to be tised to diagnose 
auditory perceptual deficits. The notion that nothing cao be sirzpivr 
than asfeing 'the child if two spoken vords are the sane or different 
prevails-. But, as the Blank' study demonstrates, a simple initativ^ 
articulation task provides a puifer measure of perceptual accuracy 
than does the test requiring --sane-different judgments'. 'The imitation 
task, we can assun:e, is not trustworthy because ciisarticulations and 
nisnerccptions are confounded. Crucial to thg very -definition of an 
auditory discrinination cask is that It Bypasses ar'ticulatory responses 
whit;h in themselves ^annot be used to substantiate perceptual inao'equa*. 

LINGUISTIC FACTORS IN AUDITORY DISCRLMINATION ASSESSMENT 

E>J>£Ri;-lENT 1 : MINmLLV MiD MAXIILVLLY DIFFERENT CONTRASTS 

Kilile it is not immediately necessary of even possible to isolate*" 
each and every process or skill that plays a role in the t^d;itional 
paradigm of auditory <iiscrimination assessment, it is important to 
know to what extent perceptual confusability of speech sounds is 
actually being measured. In an effort to' establish a means for making 
such an appraisal, an experiment in speech sound' discrimination was 
conducted using, as test items, all possible English consonant contrast 

While it is usual to test only minimally distinct consonant con- 
trasts-, it was hypothesized that maximally differenr: contrasts, i.e», 
those' t4iat differ .by four, f ive, and six features, can serve as an 
appi;opriate^*conu:oI condition to support, the claim that perceptual 
conf esability^ has been measured. Since consonant conf usability is, 
in general', a function of the number of feature differences involved 
in a contrast (Tikofsky & kclnish, 1968; Graham & House, 1971?, any 
test sensitive to perceptual confusion should yield more errors on 
minimal consonant contrasts than on maximal consonant contrasts. If 
subjects' error rates are equivalent acros% minimal and maximal 
categories and at the same time beyond the range of chance, it would 
seem safe to conclude tKat such error rates are due to factors other 
than inadequate speech' sound olscrimination ability . fo maintain the 
traditional procedure of presenting two stimulus syllables for comparis- 



and, at the sane tiae, avoid the problen of saae--dif ferent judgments, 
a ^forced-choice, r:ntching-t?o-sanple (A-B-X) procedure w5s employed. 



Method • & 

Participants . Twenty first-grade pupils, randomly selected- frorn^ 
three classrooms. in an elementary school^ and eight pupils ^rom a_ _ 
4-year-old cl^ss at a i?jce^chool served as ^s. All Ss were liormal- 
hearing, songlingual speakers of Englirh* 

Procedure. The test' sessions were held with individuals in a , 
soundproof experimental trailer set up os .the^ school grounds. Test * - 
items, consisting of rwo contrasting syllables, ^were presented in a 
forced-choice',. ma tchi^ng-= to-sample (A-B-X) procedure- Participants 
were seat^ed m itlyav between^ he, i:wo speakers of a stereo ffape recorder. 
a' warning signal (1000-cycle tone) followed by the firstmember of 
the contrast pair (syllable A) wa^ heard over the left speaker. One 
second later the second member of the <?5ntrast pair (svllaple B) was. 
- heard over the ri^ght speaker. One second later "who said X?" (where 
X is either A or B) was heard stereophonically. Three seconds eldpsed . 
before the- warning signal initiated the next trial, ^Subjects res- 
ponded by pointing to one of the speakers. The ^experimenter, who was ' 
seated *behind\S at all times; recorded all responses immediately on 
•prepared data sheets* ^ 

Stimuli . All consonants of- English, i.e., phonemes, whose ^featurfe * 
'specification includes [-f- consonantal] and [- vocali>p], w^re paired 
with one another, each in combination with the vowel /a/,^ for a total 
'of l7l CQtitrasting syllables. For reference,, all contrasts categorized 
by distinguishing features* are presiented in Appendix A. 

^ . ' ' ' ' 

' All contrasts were in the syllable-ii\itial position witK the 
exception oi those -involving Hi or /o/. In these cases the contrast 
occurred in syllable-final ^ositijon' (e.g. , /aV vs /ao/). Nine d^ramy 
Items were constructed from liquid and glide combinations to make 180 
pairs. These were randomly .divided into six lists, 30 items" per .lisfe,*'^ 
which were' tape recorded in 5 sound studib by the experimenter. Because" 
a pupil only responlied once to each item and because it is necessary to 
counterbalance the^ occurrence t)f a given item between A-BtA and A-B-B 
instances, ^twb* sets^ of tapes were made. Any item appearinjg in A-B7A 
focm in one set of tape^ appeared in A-B-B f^m in the other. Half the • 
pupils in an age group got one set of tapes aW the other half the 

'Other set* Pupils were tested over 7 successive school days, 1 day of 
training followed by 6 days of testing. Training was achieved by selec- 

\ ting one of the test tapes at jrandprn and leading the child through the 
actual test conditions with appropriate instrfctions and feedback. On 
test days, the experimenter merely said "good*af<er each pointing- 
response by the child. , ^ , * 



Results . ^ * , 

' For the analysis of the exyror data, the te^t pairs '(171-.consonant 
contrasts) were categorized according to the'nunber.^of features distin- 
jguishing the contrasting consonantsC Since consonants can- contrast ^ 
one to six fe^atures Csee AppendjLx A), 6 categories' resulted . Figure 1' 
displays the mean error rates for each of the age groups as a function 
of the number of contrasting 'features/^ 
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Featyire Category^ 



Fig. 1. Mean error rates plotted for 4- and 6-year-old 
groups according to the number^ of features distinguishing 
the consonants in a ccrntrast 

For ihe 6-year-olds, the overall error rate (171 items) was 12.1%, 
with a range among feature category means from 9% to 16%, and a range 
among pupil means from 3% to 28%. For the 4-year-Qlds, the overall 
error rate was 19% with a range among feature category means from .9% 
to 35%, and a range among pupil means from 12% to 27%.' The source 
table obtained from a two-way analysis of variance (subjects within 
group [h4- or 6-year-old;^ ] by fd^tur6 category il through 6]). is ^ 
presented in Table 1. , . t 



» ^ Four-year-olds made significantly more errors than 6-year-olds. 
Performance on discriminations at different levels of feature differ- 
ences was not equal, and this inequaiity. interacted significantly with 
age group. These three results, cts shown by subsequent paired com- 
pa^'isons, can all be accounted for by the same datum, viz., that the 
A-y^ar-old group exhibited significantly more errors^ on contrasts 
distinguished by only* one feature. Paired comparisons were mad^ among 



Table 1 



Source Table for 2-Way M0?A: S^s within 
• ^ 2 Gro\ips 2^ 6 Feature Categories 
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*p<.05. 



cell means according to jjrocedures for unequal group sizes presented 
in Winer (1971, p. 603). These tests show no reliable differences 
among feature category, means for the older group. .The only reliable 
difference found for the younger group was betwe^ the first feature 
category (where consonants contrast on only one feature) and all 
others. Furthermore, 'the only category in which the younger group 
made significantly more ertors than the older, group is the first-. In 
'other words, both main effects and the interaction,&f feet revealed by 
the two-way MOVA are a function of the 35% error Yate for A-year-olds 
on minimal-pair contrasts (see .Figur'e 1). 

■ m ■ , 

Discussion 

" The finding that only one-feature contrasts cause significant 
discrimination problems i^ reinforced by the results of several pre- 
vious studies. Tikofsky and Mclnish (1968), in an experiment where 
all possible contrasts among 15 consonants were tested on 7-year-olds, 
concluded "that as the number of feature difference's increases, error . 
rate will 'drop. -In fact, any increase beyond the one feature difference 
assures almost perfect .discriminabildty in terms of this experiment 
(p. 62). Graham and House (1971), who tested children ranging m age 
from 3 to 4.5 on all possible contrasts among 16 consonants, elicited 
error patterns .as .depicted in Figure 2. These authors report that 
an^ans^lysis of variance and Newman-Keuls procedures showed that the 
error rates f or. o'ne- and two.-feature differences were reliably dif- 
ferent while 'two- through six-feature contrasts did not differ. 
Graham and House concluded "that discrimination improved with the 
addition of a second feature, but did not improve significantly with 
further additions of featur;es" (p. 563). 



In the' present study, two distlnct^age groups were subjected to 
the same test format* and testing conditions* ."Previous studies either 
tested agei-group, differences and ^ignored featxiral analysis (Templin, 
1957; Deutsche 1964) or vice versa ' (Tikof sky McInisK, 196'8; -Graha^n £c 
House, 1971) * The* significant interdiction , between ^age group, mid 
feature, category reflects t^e fact that the 6-year-^olds, a^^- group, ^ 
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* ' Fig 2* Mean error rates from Graham and House (1971)^ 
^s :according to the number of feature^s distinguishing 
the consonants in contrast/. ' ' ^ ; ' 

have no more difficulty discriminating minimal contrasts than contrasts 
distinguished by ^f our, five^^r six features. In other woYds, these 
data suggest ^ tbat^ they have no discrimination difficulty at all. Thfs 
conclusion is based on the s'lipposition that perceptual confusion arrears 
will be distributed unequally across feature difference categories 
(tiie lower the number, .f' higher the error rate) ^ while^ error- rates 
du,fi to 'task. constraints ve*g* /Qtfenfrion pfObiemsoor storage and 
retrieval problems) will be distributed equally amojrig feature ^ifferenc/i 
categories. . > * - 
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At two, three, and four-features different, the performance of ' , 
4-yea,r-olds wa^ not significantly different .from the performance .of 
6-year-olds, but it did exhH)d?t**a predictable p^ttern-^errqr raEes 
in Jthese categories we^e higher for ^-year-olds. At f ive-f ea^turJES 
different, however, feher6 was clearly no difference in pefformsftce 
between the gi^^oups. Contrasts thaf are six~fe£ftures. different are 
npt worthy of discussion since there are not enough of them to justify 
any comparisons. Five-feature differences appear to represent a clear 
indication of baseline ,f)erformance. The data suggest thdt error rates 
at this level are more likely attributable to task constraints *than 
perceptual confusion. Sii^c'e no reliable meaning can be atti?ibut.ed to 
scores achieved on currently existing auditory discrimination tests," 
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the needs of those who rely on such tests can be served by suggesting 
sieaningful ways to inprove test design. With little apparent trouble, 
control iCens fron naxinal contrast categories can be inserted into 
auditory discr|.niination tests. Assunptions about perceptual confusion 
Vprobleas can be tested i>y appropriate coapari^ons between error rate^ 
on siininal and naxinal pairs. 

The prcisent approach suggests ^a different v-iew of "auditory 
discrinination deficit": than is found, fbr example, in the Wepiaan 
.^lanual of Directions (1958) . There it is asserted that a 5-year-old 
vrhc» hakes nore than six errors (oo "different" items) is revealing 
"inadequate developnent ." Six-year olds vho*inake nore than five errors 
are placed in the same category. Children 8 years of age or older are** 
allowed three errors before being judged deficient in auditory discrim- 
ination skill. Based on the data presented in Table 2, it is more 
likely that any Snadeguacy lies in the criteria established for the 
Wepnan test. In Table 2, error data fTOm several studies ^n which 
the.Wepaan test was used are presented. The error data contained in 
the table reflect scores achieved by ^control groups only since non- ^ 
norzial behavior is not at issue. ' r 



Table 2 

Error Data for Control Groups fron Various Studies in whicii the 
* Wepnian Test of Auditory Discrininatic^n Was Administered 



Study 


:."o. 


of S^s 


Age Range 


Mean No. 
of Errors 


' Wepman 
Criterion 


■;leni:.y4eS:7 (1972) 


- 


30 


- 

5-6 


5.2 " 


6 


^£vjr-»li tt iviftnrlctck 
(1966) 


: 


120 


5-6 


'9.3 


6 


■ ai^r al. (1968) 

• 




128 


6-7 


9.8 


_ / 

D 


Prins (1963) 

* ** 




19 


6-7 


5.6 


5 


5'?jdeseair £< Kamil - 
(1970) 




12 


'6-7 , 


- 

5.C . 


5- ' - 


iK-utsch „ (1964) 




' 8 


' . 6-7 


7.8 


5 


^trlin £ Dill (1967) 




11 


8-9 


5.9 


3 


Dfutsch (1964) 




8 


8-9 , 


5.5 


3 


I)u(ft-;ch (1964) 




■ 8 


' 10-31. 
f 


* * 

6. .9 


3 


Vellutino et al . 

(1972) 


/- 


60 - 


10-13 


- '3.03 


■ 3 - 
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Ajje of Ss, nunber of Ss and jzeart niznber of errors (on "diff^^rent" 
pairs) provide enough* information to conclude that the average chJle 
in each group fails to neet the ti^tst'*^ ovn standard for adequate 
develcpaent. With the exception of the Elenbogen study, the cean 
nunber of errors for Ss in each of these studies exceeds, the value 
designated as a cut-off point for the age group in Question* ' 

The Wep^n'test is to'pical of currently used tests in that ail 
rely on a global discrixrination scoYe- These tests are apparently 
deeiied useful because they are easy to adninistec-ajid score. A cc--.- 
pronis^s between an aabiguous global score and complicated subsets ax 
scores appears attainable with a list that conbines ainii3al**and^ . 
contrasts. Discriiaination difficulty or "deficit" in this context i.. 
represented *by the difference in perfomanceTjetveen the tvo itezi V j*. 
Not only is such a test relatively easy to score, but it offers snz:e 
guarantee that the ceasij^e obtained is a function of perceptual c^nii 
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EXPERIMENT 2 : ^lEXllVlGFVL ATI?) ^^EMIVAGLFSS STIMl'LUS SYLLA.^;LES 

While it has been firmly established that extra-lingufstic Jacti r 
play a part in speech ^sound discrimination test performance, little 
attention has been paid to the conpstirig linguistic factors inherent 
in discrimination tests. In addition to the' speech sound conpariaons 
required in sam^-different and A-B-X tests, the neaningfnlness of tht^ 
syllables used to present the contrasts appears to have an important 
influence on test results. In the view of some investigators, auditor 
discrimination, can be measured by^prearenting, in succession, two 
familiar words that differ with regard to one of their constituent 
phonemes and asking the child to judge then "sane" or "different.** in 
opposing view it is held that meaningful words are not appropriate fur 
such a test because those children in the sample ^familiar with the 
words have ^ advanjtage over those to whom the words are less faniliiir 
Thus, it is argued, a more sensitive measure of speech sound discriir^i- 
nation can be achieved by using nonsense syllables because they arc 
a means of controlling for familiarity. Implicit in this strategy, 
however, is the notion that children are capable of processing speech 
sounds out of context — or out of any meaningful context « This suggest" 
that speech sounds are, in some sense, functional perceptual units 
which can be processed in and of themselves. In this experiment an 
attempt was made to test this assumption because of its significance 
for the desxgn of auditory discrimination tests. In addition, whether 
cKildren can or cannot 4)rocess phonological data as such has a bearing 
on the linguistic nature of the discrimination task involving nonsen.*^e 
syllables. , Previous studies comparing children's, ability to dis- 
criminate sounds in meaningful versu§ nfeaningless contexts have been 
inconclusive. ^* 

# f 

' Sapir (1972) hypothesized that Vchildren's performance in audii»'^r 
discrimination is significantly better when words rather than nonsense 
syllables are used in the task." Yet no nation of why this might be 
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crutr V3S provided- H«r data froa 150 children ranging in age froa 3 
to *^.^: shoiud no reliable differences betveen the tvo stisulus types- 
5apir su^gestt24, by way of explaining this result, that children aay 
not be ^fcie to process ceaning when the vord is separate froa its 
. ntext, iaplving that ali itens on the test were, in,effect, ceanxng- 
less/ This suggests that if real words were found easier than nonsense 
i;yLla'fal.iS, the "aeaning" factor would account for the results- 

Perozzi and Kunze (1971) intended to deaonstrate that word and 
nonsense tests wen equivalent in their aeasureaent properties so that 
fuK' of svUable tests, though apparently artificial, could be 
iv^tiiied. The advantages of the nonsense-syllable tests, according 
t . Perozzi and Kunze, lie in the potential they offer for constructing 
,1 t<.st with controlled degrees of difficulty. x\pparently, this potentoal 
ii 3 function of the fact that nonsense syllables a libw testing of a 
wide range contrasts, whereas real words aUow testing of only those 
3inia.U iiuund contrasts that actually occur in the lexical inventory of , 
rhildrtn. 

In diKcussini; why the two stimulus types nay elicit different 
ili^crininirion perlornance, Perozzi and-^unze cite Weiner's (1967) 
argunent chat paired-word tests are less abstract than paired-syllable 
tests because the paired words signal a difference in meaning as well 

a difference in sound- Neither Veiner nor Petozzi and Kunze discuss 
th'j inplit!ations of this two-level signalling for a psycholinguistic 
riodel of sound discriminaticn behavior. Perozzi and Kunze 's 30 
kindergarten pupils, who ranged in age fron 5.3 to 6.5, showed a jnean 
.A correct responses on the Syllable Test, compared with 73.6 on 

the T.e<;t. No test of significance was reported. The Pearson- 

Fr'ducI correlation betweeu the two tests was 0.873 tp<*01) and the 
investigators cuncluded that the performance on one test would be 
highly predit-table from performance on, the other. 

labile both the Sapir,, and Perozzi and i^unze studies show no reliable 

till f»/rvnc.-3 between words and nonsense syllables, studies by several 

inveiiti^^ators confirmed such a diffes:^nce. Two of these. Blank Q-968) . 
and tCatz and Deutsch (1967), tested Hebrew versus English-^timulx and ^ 
thus -represent more than the simple ^^ord versus nonsense experiment. 
The intruduction of foreign sounds Represents a confounding, factor. 
Elenbogen and Thompson (1972), however, used the standard Wepman Test' . 
of Audi^ciry Discriminatioa j(1958) and a test they called the "Distorted 
"Weprnan" in which the real words in the original Wepman were distorted 
into nonsense syllables while the same contrasts were maintained. In 
a repeated measures design involving a group of suburban, Anglo' 
children of middle socioeconomic status and a group of urban Blacks 
children from a low socioeconomic range, these investigators found 
that error r^tes were no different for Anglos and Blacks on the dis-- 
torted Wepman ?and Blacks on the standard Wepman. But error rates for 
Anglos on the standard Wepman were significantly lower than the other 
scores* The investigators concluded that middle class children have 
difficulty discrimin/lting minimal pairs when meaning is removed from 
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the carrier syllables. They further ^u^^j^vst that the "viicabuUry 
factor" is an aid ti:» pcrfjornance tht- Kifp^J^ T%:st vi Audit>-»ry 
Biscrinination- 

ElenbojRen and Tl^cnpson exp-ind on tuis thene to nake- tne./p^dnt 
tliat real vord pairs "would prox^ide clues to children vith extent Iv^v 
language backgrounds. This would detract i'ron accurate assessntnt 
sj£ auditory discrininatic»n and would .aff*2fct tlottzs %'hen childrc-n viti: 
i^aried language t^xperience are evaluated vit!i" the saz;e noms" (p. JII *- 
Tills clai3 that real wrd pairs lead to an inaccurate profile of ^i^^.:-<.. 
discrimination ability vas echoed by Peruzzi and Kunze vhen they 
suggested "that i'-cne youngsters nay have an unforeseen adv^antagc owr 
other::> vhen taking paired-vord test's" (p- iSU). 

It is ironic that the argument that neaning enhances discririir>abl 1 
has been nade nost /orrefully in the reports of two studies in w3^i h 
difference between real and nonsense conditions *?as observed. If. |,he 
argunent itself has s^ubstance, the obvious conclusion is that it dot-^ 
not apply in the case of the children tested by Sapir, and Pero^^i ani 
Kunze, perhaps because, at 5 and h years of age, these children were 
capable of successfully, executing any auditory discrimination task. 



Experinent 2Ar Four-Year-Qlds 

The present study was intended to investigate relative discrinir*- 
ability of contrasts in neaningful and nonsense carrier syllables for 
children younger than age 6. y - 

Method - ' ^ 

Participants . Ten pupils from a 'i-year old olass at a public 
preschool were administered the test. All children were monolingual 
Anglos with ^o known hearing loss. 

Procedure . The procedure followed in this experiment was 
i^en^lcal to that used in Expeifiraent 1, i-e. , all syllable pairs 
were presented according to the A-B-X format. 

Stimuli . Test items were 16 consonant contrasts all ol which 
rej)resented a difference in only one articulatory/^coustic distinctive 
feature. Test items are shown in Table 3. Since each list contained 
nonsense-syllable and real-word representatives of each contrast^ 
there was^-a total of 32 test items per list. Real and nonsense items 
were randomly mixed in the lists. For purposes of counterbalancing 
the order of presentation of the two members of a contrast pair as 
well as the position of the queried member, four lists were required. 
Four tapes, each, representing a different counterbalancing version of 
the master list, were recorded by the experimenter and administered 
to each participant over four successive school days. Thus, each 
child responded to each real-word ^contrast four times and to each 
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nonsense-syllable contrast four tises- Two scores were tabulated 
for each subject — percent error in respons.e to real-word contrasts 
and percent error in response to nonsense-syllable contrasts. 



Table 3 

Sixteen !Iininal Consonant Contrasts Eaployed in '-the Present 
Study in Real-word and Nonsense-syllable Foms - 



Contrast 


Konsense 


i 

Fora 


Real-word Fom 


pt 




ta 


pie " 


tie 


^- re . " 


' fa . 


r^a 


fin 


thin 


nn 




na 


fT^eat 


neat 


pk 


pa 


ka 


pan 


csn 


si 


■ sa 




see 


she' 


nq 


an 




rxu 


ring \ 


pt) ■ 


pa 


ta 


pig ' 


big 


td 




-da 




do 


*g 


ka 




came 


game 




fa 


va 


fairy 


very 




ea 




tbigh 


thy 


SZ 


sa 


28 


sue 


zoo 




, Sa 


J** 


choke,. 


joke 




sa 


ea 


sihg^ ' 


-thing 


4* 


23 


03 


zees 


these 


tk 


ta 


ka 


take 


cake 



Resuxts / 

The 10 children who responded to the experimental tapes showed 
a mean error rate of 34.5% for real word items and 47% for nonsense 
syllable items- Table 4 is a list of error rates for each child as 
a function of the two stimulus types. No child showed a reversal . 
of the trend to do better in -esponse to real-word items, although 
S2 and S^IO showed nearly equivalent performance in response to both 
item types. ,A two-way analysis of variance (Item Type (2) x Subjects 
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Tabic- 

I!ean Error Races for E 
a Function of Two Stii:u 




In percent"^ 

(10)) showed a significant^ difference between responses to the two 
item types (see Table 5). It should be noted that overall perfor- 
raance on nonsense-syllable pairs was at a level probably equivalent 
to chance (nean = 47% error rate). " * 



Table 5 ^ - 

Source Table for 2-Way A?:OVA 
(Item Typ'e X Subjects) 



Source of 
Variation 


df 


. Mean 
Sqdares 


F-RATI0 


Item Type 


1 


288.7 


43.08* 


Subjects . 


9 


. 23.3 ■ 




Residual 


9 


6.7 













*p<.01. 
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Experi-aeat 2B: Five-Year-Olds 

in Mst respects this exp'ericent was identical to Expericent -2A 
•Hie differences involve ninor *nodifications^ in list cons truest ion, 
number of test days, and tjie age of the children- tested. ^ 



Method 

Participants . Twenty^pupils from a 5-year-old class at a public 

school were administered the test. All children were monolingual 

Anglos with no known hearing loss. The experimental procedure was 

the sane as in the first experiment. 
* • * * 

' Stimuli ^ Test items were the saraFT? consdnaiiF^on^^ as. in 

Experiment 2A. Two separate lists were constructed. One list con- 
sisted of only the real-word contrasts and the other consisted of the 
nonsense-syllable contrasts. Each list contained 32 iteds since each 
item appeared~in both A^B^A and*A-B-B form. The position of the quer: 
item was not counterbalanced in tfixs experiment. There were two days 
of testing. Ten participants (Group A) received the real-word list 
and 10 (Group B) received the nonsense-syllable lists on the first 
day. On the second day each group received the opposite list. 



Results . . , • 

The children tested exhibited a mean error ratS of 21.2% in 
response to real-word contrasts and 48:7% in response to nonsense- 
syllable contrasts. /As shown in Table 6, this differential was 
influenced by the reversed order of presentation between the two 

Table 6 



5lean Error Rates for^ EacIv of Two eiroups-^'as S'Fanction of Days 







Day 2 


Group A 


21-2% '-.\ 
(real<>wDrds) 


45.6% . 
(nonsense syllables) 


Group B 


51.8% 
(nonsense syllables) 


21.2% 
(real words) 

— — «j — 
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groups- For the purposes of the analysis of variance, the data vere 
tabulated in terzis'uf two factors, Gr<:?ups . (A and B) and Days (1 and 2) 
Tims/ differential error rates for real-vord and nonsense-syllable 
contrasts vill appear is 'a signiVitant j?roup :< days interaction. The 
two-way analysis of variance ccnf imed a si>;nificant Groap x Days^ 
interaction '(F (1,18) = 17.80, p<.01)i 'As in Experiiaent 1, it should 
be noted that perfornanre in response to, noasense-syllafile contrasts 
was equivalent to chance (45-6 and 51.8 percent, nean error rates) igr 
both groups. 



Discussion . - . ' ^ 

The results of Experinents 2A and 2b clearly demonstrate dif- 
ferential error potentials between nonsense-syllab\e and real-word , 
contrasts when preschool children are tested. For all practical 
purposes, the i:hlldren tested in these experiments were unable to 
discriminate between the nenbers of nuUi^ense-sy liable pairs. Yet 
this failure cannot be 'accounted for on the basis of the' particular 
ziininal. sound contrasts presented bee iu.se the same contrasts comprised 
the real-word teijts. . *Cor is it reasonable to argue that the nonsense 
stiiiuli* require sound discrimination «while the real-word Stimuli 
require only a discrimination' of nearilngs. 'Success on either type of 
stinulus pairs requires that the sounds be discriainated. While mean- 
ing is a factor in one task and pot in the^ other, dif f erenrt meanings 
are a function of different sounds and accurate sound perception is 
prerequisite to accurate" recognition of the vords themselves. 

The experinents conparing discrdUninability of speech sounds in _^ 

thu context of real-word paiVs and di^>criminability of the same speech 
sounds J.n nohs^nse-syllabl^i ^airs bear on the question of the form o| 
:^tored linguistic data in tne cognitive system of the 'preschool child. 
Two nodes ^f stored data can be considered. Either the child has 
access to speech sounds as a function of stored word entries, or he 
has access to speech sounds per se. Such speech sound concepts can 
be conceived of as cognitive representations of speech sounds which 
underlie the organization of phonological data in memory. Word con- 
cepts can be assumed to be prior in origin to speech sound concepts. 

Consider the auditory discrimination task in which contrasts are 
presented in the form of real-word pairs. Psychological^ processes 
already established, i.e., normal word-recognition processes, allow 
the child to successfully differentiate such pairs, j^ord concepts 
serve to mediate the required discrimination. Nonsense-syllable 
pairs, on the other hancl, may or may not relate to previously developed 
processing skills.' If. the nonsense syllables are.' perceived as words, 
the same* prbcesses as discussed for real-word stimuli come into play. 
If meaningless syllaSles are perceived as such, the system either 
breaks, down or is equipped to deal with such syllc^bles on the basis 
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of speech sound concepts. Successful discriaination in the case of 
nonsense-syllabie stimuli turuL on the system's capacity to recognize 
the onset of one syllable "-^ " ound x" and the onset of the other 
syllable as "sound y" (or, in uone tasks, non-x). Like word concepts, 
speech sound concepts mediate discrinir^ation of contrasting- pairs 
when stimuli are input in the form of meaningless speech' sounds. This 
view of two independent inventories of functional perceptual units is 
not ndcessary but does represent a sufficient theoretical framework 
to account for the behavior observed in the experiments at issue. 

* ' * ' 'r u 

The 'development of" speech sound concepts can,dQCOunt loj other 

linguistic behaviors observed in r^earch on the acquisi-tionrofTiliono- 
logical competence. Phonemic segmentation, sounding-out, sound blend- 
ing; and* other word analysis tasks can all be seen to relate to the 
acquisition of the speech sound as a functional perceptual unit. While 
speech sound ^concepts are not a necessary aspect of normal phonological 
development, they appear to play a role, for example, in the acquisition 
of phonic reading skills. Speech sound concepts may, in fiict, account 
for corre'lations reported between poor performance in early reading 
and poor performance- oif auditory discrimination or other word analysis 
tasks (Dfirreil & Murphy, 1^53; Mepman, 1960; Dykstra, 1966; McNeil & 
Coleman, 1967). 

Claims were made -in the reports surveyed earlier tfhat illegitimate ; 
m^sures of Auditory dj^crimit^acion would be obtained through the use - . 
of paired-word tests due to an "unforeseen advantage" for those children 
familiar with the words in the test. It can be countered that the 
ability to differentiate minimal ghonemic differences can legitimately 

be related to paired^word tests or paired-syllable testis depending on 

what notion of auditory discrimination abilitff one adopts. If an 
investigator merely' intends to assess the child's ability to differ- 
entiate the functional units' of speech, why should it matter that 
'meaningful word entries serve to mediate the distinctions which pernaps 
only have reality as a function of the lexicon? If , ^on the other- hand, 
an investigator intends to differentiate among chiddren on the basis 
of 'cognitive functions that result from linguistic experience, he may 
crfoose to verify the formation of speech sound concepts by testing the 
child's ability to process speech sound data as such, i.e., in the 
form of "meaningless syllables. * 

To what extent traditional auditory discrimination tests involve 
linguistic processes is unknown dnd, unhappily, unquestioned. The 
experiment demons trii ting differential discrimination potentials between 
real-word' -pairs und nonsense pairs suggests that for a test to reflect 
.linguistic skill, it must respect the cognitive framework in which ' i 
linguistic data is organized. While nonsense syllables posing as 
words have been used extensively with apparent success in psycholinguistic 
research, little is known about the young child's ability to deal with 
nonsense syllables per se. 
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While it is not disputed that traditional auditory discrini-nation 
procedures involve several linguistic levels and a network of psycho- 
logical processes, it has apparently been assumed that these factors 
cannot adversely^ affeci; t^f scores* The studies discussed in this 
paper decsonstrate that traditional auditory discrimination tests do 
not represent straightforward or reliable indices of speech ^ound 
discrininability. ' Low scores on the tests cannot be equated with 
iiiadequately developed speech sound discrimination. Alternatives to 
esta'blished procedures for assessing speech sound perception are 
clearly required- Ttiere a ppears t o^J^f* no r»^r»c:nn^A3hy tasks *thar reflect 
^rAy the llnguTsTIc slauLls at issue are precluded* For example, 
presently used tasks involve a natch or misnatch between succes- 
sively presented rf^^yllables. Yet .Jaisperceptions of ^the sort that ^re **, 
likely to affect speech comprehension involve a mismatch between 
utterance and cognitive representation. A test modelled on matching- 
^to-memory is not only more relevant but also avoids the short-term 
memory problems inherent in same-different and even A-B-X procedures. 
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app^:dix a 



ALL POSSIBLE COrCSQX.m OPPOSITIONS CATECORlZEh BV THt DlSTiyCTIVK 
FEATURES ON imiOl THEY CONTRAST 
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tare differences 
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